Data Processing
   HOME

TheInfoList



OR:

Data processing is the
collection Collection or Collections may refer to: * Cash collection, the function of an accounts receivable department * Collection (church), money donated by the congregation during a church service * Collection agency, agency to collect cash * Collectio ...
and manipulation of digital data to produce meaningful information. Data processing is a form of ''
information processing Information processing is the change (processing) of information in any manner detectable by an observer. As such, it is a process that ''describes'' everything that happens (changes) in the universe, from the falling of a rock (a change in posit ...
'', which is the modification (processing) of information in any manner detectable by an observer.Data processing is distinct from ''
word processing A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no conse ...
'', which is manipulation of text specifically rather than data generally.
The term "Data Processing", or "DP" has also been used to refer to a department within an organization responsible for the operation of data processing programs.


Data processing functions

Data processing may involve various processes, including: * Validation – Ensuring that supplied data is correct and relevant. * Sorting – "arranging items in some sequence and/or in different sets." * Summarization(statistical) or (automatic) – reducing detailed data to its main points. * Aggregation – combining multiple pieces of data. *
Analysis Analysis ( : analyses) is the process of breaking a complex topic or substance into smaller parts in order to gain a better understanding of it. The technique has been applied in the study of mathematics and logic since before Aristotle (38 ...
– the "collection,
organization An organization or organisation (Commonwealth English; see spelling differences), is an entity—such as a company, an institution, or an association—comprising one or more people and having a particular purpose. The word is derived from ...
, analysis, interpretation and presentation of data." * Reporting – list detail or summary data or computed information. * Classification – separation of data into various categories.


History

The
United States Census Bureau The United States Census Bureau (USCB), officially the Bureau of the Census, is a principal agency of the U.S. Federal Statistical System, responsible for producing data about the American people and economy. The Census Bureau is part of the ...
history illustrates the evolution of data processing from manual through electronic procedures.


Manual data processing

Although widespread use of the term ''data processing'' dates only from the 1950's, data processing functions have been performed manually for millennia. For example, bookkeeping involves functions such as posting transactions and producing reports like the
balance sheet In financial accounting, a balance sheet (also known as statement of financial position or statement of financial condition) is a summary of the financial balances of an individual or organization, whether it be a sole proprietorship, a Partnersh ...
and the
cash flow statement In financial accounting, a cash flow statement, also known as ''statement of cash flows'', is a financial statement that shows how changes in balance sheet accounts and income affect cash and cash equivalents, and breaks the analysis down to ope ...
. Completely manual methods were augmented by the application of
mechanical Mechanical may refer to: Machine * Machine (mechanical), a system of mechanisms that shape the actuator input to achieve a specific application of output forces and movement * Mechanical calculator, a device used to perform the basic operations of ...
or electronic
calculator An electronic calculator is typically a portable electronic device used to perform calculations, ranging from basic arithmetic to complex mathematics. The first solid-state electronic calculator was created in the early 1960s. Pocket-sized ...
s. A person whose job was to perform calculations manually or using a calculator was called a "
computer A computer is a machine that can be programmed to Execution (computing), carry out sequences of arithmetic or logical operations (computation) automatically. Modern digital electronic computers can perform generic sets of operations known as C ...
." The 1890 United States Census schedule was the first to gather data by individual rather than
household A household consists of two or more persons who live in the same dwelling. It may be of a single family or another type of person group. The household is the basic unit of analysis in many social, microeconomic and government models, and is im ...
. A number of questions could be answered by making a check in the appropriate box on the form. From 1850 to 1880 the Census Bureau employed "a system of tallying, which, by reason of the increasing number of combinations of classifications required, became increasingly complex. Only a limited number of combinations could be recorded in one tally, so it was necessary to handle the schedules 5 or 6 times, for as many independent tallies." "It took over 7 years to publish the results of the 1880 census" using manual processing methods.


Automatic data processing

The term ''
automatic data processing ADP, Inc. (originally an acronym for Automatic Data Processing) is an American provider of human resources management software and services. History In 1949, Henry Taub founded Automatic Payrolls, Inc. as a manual payroll processing business wit ...
'' was applied to operations performed by means of
unit record equipment Starting at the end of the nineteenth century, well before the advent of electronic computers, data processing was performed using electromechanical machines collectively referred to as unit record equipment, electric accounting machines (EAM) o ...
, such as
Herman Hollerith Herman Hollerith (February 29, 1860 – November 17, 1929) was a German-American statistician, inventor, and businessman who developed an electromechanical tabulating machine for punched cards to assist in summarizing information and, later, i ...
's application of
punched card A punched card (also punch card or punched-card) is a piece of stiff paper that holds digital data represented by the presence or absence of holes in predefined positions. Punched cards were once common in data processing applications or to di ...
equipment for the 1890 United States Census. "Using Hollerith's punchcard equipment, the Census Office was able to complete tabulating most of the 1890 census data in 2 to 3 years, compared with 7 to 8 years for the 1880 census. It is estimated that using Hollerith's system saved some $5 million in processing costs" in 1890 dollars even though there were twice as many questions as in 1880.


Electronic data processing

Computerized data processing, or Electronic data processing represents a later development, with a computer used instead of several independent pieces of equipment. The Census Bureau first made limited use of
electronic computers A computer is a machine that can be programmed to carry out sequences of arithmetic or logical operations ( computation) automatically. Modern digital electronic computers can perform generic sets of operations known as programs. These pro ...
for the 1950 United States Census, using a
UNIVAC I The UNIVAC I (Universal Automatic Computer I) was the first general-purpose electronic digital computer design for business application produced in the United States. It was designed principally by J. Presper Eckert and John Mauchly, the inven ...
system, delivered in 1952.


Other developments

The term ''data processing'' has mostly been subsumed by the more general term ''
information technology Information technology (IT) is the use of computers to create, process, store, retrieve, and exchange all kinds of data . and information. IT forms part of information and communications technology (ICT). An information technology system (I ...
'' (IT). The older term "data processing" is suggestive of older technologies. For example, in 1996 the ''
Data Processing Management Association In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpr ...
'' (DPMA) changed its name to the ''Association of Information Technology Professionals." Nevertheless, the terms are approximately synonymous.


Applications


Commercial data processing

Commercial data processing involves a large volume of input data, relatively few computational operations, and a large volume of output. For example, an insurance company needs to keep records on tens or hundreds of thousands of policies, print and mail bills, and receive and post payments.


Data analysis

In science and engineering, the terms ''data processing'' and ''
information system An information system (IS) is a formal, sociotechnical, organizational system designed to collect, process, store, and distribute information. From a sociotechnical perspective, information systems are composed by four components: task, people ...
s'' are considered too broad, and the term ''data processing'' is typically used for the initial stage followed by a
data analysis Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Data analysis has multiple facets and approaches, enco ...
in the second stage of the overall data handling. Data analysis uses specialized
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algorithms are used as specificat ...
s and statistical calculations that are less often observed in a typical general business environment. For data analysis, software suites like
SPSS SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
or SAS, or their free counterparts such as
DAP DAP or Dap may refer to: Science * DAP (gene), human gene that encodes death-associated proteins, which mediate programmed cell death * Diamidophosphate, phosphorylating compound * Diaminopimelic acid, amino acid derivative of lysine * Diamin ...
,
gretl gretl is an open-source statistical package, mainly for econometrics. The name is an acronym for ''G''nu ''R''egression, ''E''conometrics and ''T''ime-series ''L''ibrary. It has both a graphical user interface (GUI) and a command-line inter ...
or
PSPP PSPP is a free software application for analysis of sampled data, intended as a free alternative for IBM SPSS Statistics. It has a graphical user interface and conventional command-line interface. It is written in C and uses GNU Scientific Lib ...
are often used.


See also

*
Data processing system A data processing system is a combination of machines, people, and processes that for a set of Input/output, inputs produces a defined set of Input/output, outputs. The inputs and outputs are interpreted as data, facts, information etc. depending ...
*
Big data Though used sometimes loosely partly because of a lack of formal definition, the interpretation that seems to best describe Big data is the one associated with large body of information that we could not comprehend when used only in smaller am ...
*
Computation Computation is any type of arithmetic or non-arithmetic calculation that follows a well-defined model (e.g., an algorithm). Mechanical or electronic devices (or, historically, people) that perform computations are known as ''computers''. An es ...
*
Decision-making software Decision-making software (DM software) is software for computer applications that help individuals and organisations make choices and take decisions, typically by ranking, prioritizing or choosing from a number of options. An early example of DM so ...
*
Information and communications technology Information and communications technology (ICT) is an extensional term for information technology (IT) that stresses the role of unified communications and the integration of telecommunications (telephone lines and wireless signals) and computers, ...
*
Information technology Information technology (IT) is the use of computers to create, process, store, retrieve, and exchange all kinds of data . and information. IT forms part of information and communications technology (ICT). An information technology system (I ...
*
Computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to Applied science, practical discipli ...


Notes


External links


References


Further reading

*Bourque, Linda B.; Clark, Virginia A. (1992) ''Processing Data: The Survey Example''. (Quantitative Applications in the Social Sciences, no. 07-085). Sage Publications. *Levy, Joseph (1967)
Punched Card Data Processing
'. McGraw-Hill Book Company. {{Authority control Computer data